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1.0  SUMMARY 


In  this  project  two  parallel  tasks  are  merged  in  to  a  single  deoxyribonucleic  acid, 
DNA,  computing  architecture.  A  new  approach  to  aqueous  computing,  where  modified 
nucleotides  are  used  to  ‘write’  on  double- stranded  DNA  molecules  and  a  novel  '2-D' 
method  for  DNA  array  construction  are  developed.  The  aqueous  side  is  where  the 
computing  happens  and  the  array  side  is  where  the  results  are  decoded. 

On  the  aqueous  side,  binding  proteins  specific  to  each  modification  are  used  to 
selectively  isolate  DNA  fragments  and  this  selectivity  is  used  to  do  computation.  As 
there  are  millions  of  molecules  with  corresponding  binding  proteins,  this  approach  has 
the  potential  to  yield  unlimited  computing  power  as  compared  with  other  aqueous 
computing  methods. 

On  the  array  side,  current  genomic  DNA  array  fabrication  methods  are 
impractical  for  information  processing  applications  because  they  are  expensive,  difficult 
to  produce  for  universal  applications  and  lack  a  storage  capacity  that  can  rival  silicon 
based  systems.  However,  all  DNA  arrays  have  parallel  nucleic  acid  detection  capabilities 
and  this  model  can  be  used  to  support  DNA  based  information  processing.  To  go  beyond 
the  current  DNA  array  fabrication  methods,  a  new  method  of  DNA  array  construction 
that  uses  both  the  DNA  melting  temperature  and  DNA  secondary  structure  dimensions 
was  developed. 
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2.0  INTRODUCTION 


2.1  Modified  Nucleotides 

The  successful  use  of  molecular  biology  methods  as  computational  tools  by 
Adelman  [1]  has  led  to  the  development  of  a  variety  of  DNA  computing  techniques. 
Aqueous  computing  as  proposed  by  Head  and  Gal  [2]  uses  large  numbers  of  initially 
identical  molecules,  such  as  DNA,  which  serve  as  memory  registers  in  a  fluid 
environment.  Bit  values  (e.g.,  0,  1)  can  be  “written”  on  the  molecule  and  subsequently 
“read”  to  determine  solutions  to  computational  problems.  A  major  advantage  of  aqueous 
computing  is  that  the  fluid  memory  can  be  proportioned  out  and  mixed  back  together  so 
that  problems  requiring  an  exponential  number  of  steps  to  solve  conventionally,  involve 
only  a  linear  number  of  steps  in  this  method. 

Here  a  new  approach  to  aqueous  computing  where  the  “writing”  step  involves  the 
labeling  of  DNA  with  different  molecules  incorporated  into  nucleotides  is  investigated. 
These  modified  nucleotides  are  used  to  selectively  isolate  DNA  fragments  by  binding 
proteins  specific  to  the  label  of  interest.  An  implementation  strategy  for  using  these 
binding  proteins  in  aqueous  computing  is  given  with  the  successful  incorporation  of  4 
different  modified  nucleotides  into  DNA  and  isolation  of  DNA  labeled  with  two  of  these 
modified  nucleotides  using  binding  proteins..  The  commercial  availability  of  modified 
nucleotides  and  corresponding  binding  proteins  allows  for  larger  computations  while  the 
ease  of  incorporation  speeds  up  computation  time. 

2.2  DNA  Arrays 

DNA  microarrays  have  sizable  parallel  nucleic  acid  detection  capabilities  and  this 
model  can  be  used  to  support  DNA  based  information  processing  [3],  [4].  However, 
current  arrays  are  expensive  and  don't  have  an  information  capacity  that  can  rival  silicon 
based  systems.  Here  a  novel  method  of  DNA  array  construction  that  uses  both  the 
melting  temperature  and  the  secondary  structure  dimensions  is  explored. 
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This  method  of  construction  consists  of  separating  up  to  ~  10  different  50-mer 
strands  in  a  2-D  matrix  that,  by  generating  a  hybridization  signature,  is  capable  of 
recalling  or  monitoring  any  input  DNA.  Thus  the  2-D  array  can  be  thought  of  as  a  DNA 
memory.  Moreover,  because  of  the  ability  to  profile  synthetic  or  genomic  DNA,  the  2-D 
matrix  has  numerous  potential  applications  beyond  information  processing  in  the  fields  of 
bio-defense  monitoring  and  medical  diagnostics. 

Current  photolithographic  or  ink-jet  technologies  for  the  manufacture  of  DNA 
arrays  have  common  drawbacks.  One  major  impediment  for  information  processing 
applications  is  that  DNA  sequences  in  the  array  must  be  chosen  and  isolated  in  advance 
of  manufacture,  thus  even  under  automation,  constructing  an  array  with  more  that  10^ 
memory  locations  is  impractical.  Therefore,  using  current  microarray  technologies,  it 
seems  impossible  to  build  universal  microarray  with  an  information  density  that  can  rival 
silicon.  The  2-D  universal  DNA  matrix  model  attempts  to  address  this  problem.  It  is  a 
membrane  with  large  number  of  oligonucleotides  that  become  immobilized  on  the  surface 
in  a  self  assembled  process  that  is  reproducible.  It  is  constructed  by  a  method  by  which 
DNA  sequences  self-separate  on  the  surface  directed  by  their  physical  properties  [5]-[7]. 
Such  a  2-D  matrix  is  capable  of  monitoring  or  profiling  the  existence  of  any  DNA  input 
by  outputting  a  unique  hybridization  signature  that  corresponds  to  unique  pattern  that  can 
be  mapped  to  the  unique  locations  of  the  probes  on  the  membrane.  The  fact  that  10^^ 
strands  can  be  uniquely  separated  can  be  exploited  to  solve  complex  mathematical 
problems  like  SAT  [1]. 
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Due  to  the  high  sensitivity  and  eapaeity  of  the  2-D  matrix,  it  generates  signatures 
of  outputs  of  massively  parallel  DNA  eompute  operations.  These  signatures  ean  be  fed  to 
a  silieon  eomputer  for  analysis.  On  the  surfaee  this  may  seem  redundant;  why  not  just 
given  the  entire  problem  to  a  silieon  eomputer  from  the  start?  The  answer  is  that  the  2-D 
matrix  ean  reduee  an  exponential  problem  to  a  polynomial  one.  The  DNA  eompute 
information  obtained  is  based  upon  the  reeognition  of  patterns  of  hybridizations  to  the 
huge  DNA  eode  immobilized  in  the  2-D  matrix.  The  2-D  matrix  ean't  give  the  answer 
eompletely,  but  it  ean  organize  and  sort  massive  amounts  of  data,  preproeessing 
information  and  thus  making  it  feasible  for  silieon  eomputer  to  solve  the  problem. 


4 


3.0  METHODS,  ASSUMPTIONS,  PROCEDURES 


3.1  Encoding  Strategy  to  Incorporate  Modifications  into  DNA  Strands 

The  plasmid  DNA  pBluescript  was  used  as  the  starting  hardware  [8].  Labeled 
Polymerase  Chain  Reaetion,  PCR,  produet  was  derived  from  the  amplifieation  of  the 
approximately  200  base  pair  multiple  eloning  site  of  this  plasmid  with  eombinations  of 
four  different  modifieations:  Alexa  Fluor-488,  BODIPY-FL,  biotin,  and  digoxigenin 
(DIG).  The  writing  step  eorresponds  to  ineorporation  of  the  modified  nueleotides  or 
primers  using  PCR.  (See  Figure  1.) 


IN 


In  Figure  1,  primers  (P)  and  nueleotides  (N)  eontaining  speeifie  ehemieal 
modifieations  ean  be  used  in  the  PCR  to  ereate  DNA  fragments  with  those  ehemieal 
modifieations  as  shown.  Speeifie  ehemieal  modifieations  inelude  fluoreseent  eompounds 
sueh  as  Alexa  Fluor-488  and  Bodipy-  FL  or  non-fluoreseent  ones  sueh  as  digoxigenin 
(DIG)  and  biotin.  These  four  ehemieal  modifieations  have  sueeessfully  been  ineorporated 
into  DNA  fragments. 
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3.2  Coupling  the  Aqueous  Method  and  the  2-D  Array  to  Solve  SAT 

The  2-D  DNA  array  can  be  used  to  solve  the  satisfiability  (SAT)  problem  in  the 
following  way.  Given  the  conjunction  Cj  a  C2  a  . . .  a  of  disjunctive  clauses  Ci  over  n 
Boolean  variables,  the  SAT  problem  is  to  decide  if  there  is  a  Boolean  string, 
X j ,  X2 , . . . ,  ,  for  which  the  conjunction  Cj  a  C2  a  . . .  a  is  true.  This  problem  is 

known  to  be  NP  complete  on  a  silicon  computer.  However,  with  k-copies  of  the 
universal  2-D  array,  SAT  can  be  solved  without  having  to  use  the  many  step  mix  and 
split  method  of  [1].  After  DNA  discovery  of  the  solution  to  a  given  clause  Ci  by  the 
aqueous  method  (given  in  Section  4,)  the  DNA  solutions  to  each  clause  can  be  dispersed 
on  its  own  2-D  DNA  array  and  the  patterns  formed  on  each  array  can  be  uploaded  to  a 
computer.  The  computer  checks  for  common  features  of  each  pattern.  This  is 
mathematically  equivalent  to  forming  the  intersection  of  the  solution  set  to  each  clause. 
The  intersection  of  the  solutions  sets  to  each  clause  Ci  is  the  solution  set  to  the 
conjunction  of  clauses,  Cj  a  C2  a  . . .  a  ,  i.e.,  a  solution  to  SAT.  Hence  an  exponential 

problem  in  n  variables  has  been  reduced  to  a  polynomial  one.  The  important  nature  of 
the  2-D  array  that  makes  this  possible  is  that  identical  strands  behave  in  the  same  manner 
anytime  they  are  applied  and  different  strands  can  be  distinguished  by  their  different 
behavior  on  the  2-D  DNA  array.  (See  Figure  2.) 
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Hybridization 

Hybridized  on  the  2-D  matrix 
that  eontaina  a  library  of  oligoa 
(upto  ~10M7:ono  micro  molo) 
fiKed  on  a  netrocalluioaa 
membrane 


Signature  Analymia 


Signature  Comparison 

Figure  2:  Signature  Comparison  for  SAT  Using  Multiple  2-D  Arrays 


3.3  Construction  of  High  Density  2-D  array  using  SynDCode  Generated  DNA  Codes 

The  DNA  codes  used  below  where  generated  by  SynDCode  [9].  The  first  step  in 
building  the  2-D  matrix  is  to  prepare  the  duplex  DNA  code  to  be  used.  A  70-nucleotide 
long  single- stranded  (ss)  DNA  that  has  40  nucleotides  of  designed  random  sequences  at 
its  5’  end  and  a  20-mer  long  fixed  sequence  at  the  3’  end  served  as  the  primer  site  to 
convert  this  machine  synthesized  ssDNA-code  to  a  dsDNA-code.  The  number  of  the 
different  sequences  in  each  code  is  limited  by  the  DNA  synthesizer  machine.  The  current 
limitation  for  each  synthesis  by  most  DNA  synthesizers  is  about  one  micro  mole  which  is 
about  10  DNA  molecules. 
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On  this  first  dimension  gel  eleetrophoresis,  the  denaturing  strength  inside  this 
polyaery damide  gel  ehanges  eontinuously  from  0%  to  100%  (7  M  urea  plus  40% 
formamide)  top  to  bottom.  Onee  the  eleetrophoresis  started,  the  whole  population  duplex 
DNA  moleeules  begin  to  eneounter  some  low  degree  of  denaturing  strength  (from  urea, 
and  formamide).  Those  dsDNA  moleeules  that  have  lowest  melting  temperature  start  to 
denature,  and  those  have  higher  melting  temperature  remain  as  duplex  DNAs,  and  will  be 
denatured  later.  Any  duplex  DNA  moleeule  that  has  been  eompletely  denatured  would 
beeome  two  single-stranded  moleeules,  thus  run  faster  than  duplex  DNAs  under  the 
denaturing  eondition.  (See  Figure  3A.) 

Next,  the  strip  of  the  first  dimension  gel  with  the  denatured  DNA  is  eut  out,  and 
laid  on  the  top  of  a  seeond  gel.  As  shown  in  Figure  6,  the  DNAs  in  the  first  gel  are  now 
separated  aeeording  to  their  melting  temperatures,  and  those  moleeules  that  have  roughly 
the  same  melting  temperature  are  now  migrated  to  about  the  same  position  in  the  first 
dimension  of  eleetrophoresis.  The  seeond  gel  is  a  native  gel.  Here,  instead  of  denaturing 
the  DNA;  the  seeondary  struetures  of  the  ssDNAs  that  have  about  the  same  melting 
temperature  are  maintained.  In  this  seeond  gel,  the  ssDNAs  are  separated  aeeording  to 
their  seeondary  struetures  formed  by  the  intra-moleeular  hydrogen  bonds.  The  DNA 
moleeules  will  be  further  separated  on  the  seeond  gel  resulting  in  a  smear  orthogonal  to 
the  original  eleetrophoresis  direetion  in  the  2-D  spaee  on  the  seeond  gel.  This  is  the 
seeond  dimension  separation.  (See  Figure  3B.) 

The  seeond  gel  then  is  blotted  on  to  a  nitroeellulose  membrane.  This  is  2-D 
matrix.  The  membrane  ean  then  be  used  as  the  universal  array  for  deteetion  of  any  input 
DNA.  Figure  3A  demonstrates  the  melting  temperature  dimension  and  Figure  3B 
demonstrates  the  seeondary  strueture  dimension. 


««h  bnraMino  dMiiiwIno 
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Ttml 


2-D  Increasing  Pore  Sizes 
(20  to  10  %  Polyacrydamide; 
non-denaturing  Condition) 


Molecules  run  at  the  same 
position  in  the  first  dimension 
would  have  the  same  Melting 
temperature,  but  will  form 
different  secondary 
structures  under  native 
conditon  of  the  second  gel 


A. 


B. 


Figure  3:  Separation  of  DNA  Code  on  2-D  array 
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4.0  RESULTS  &  DISCUSSION 


4.1  Mathematical  Computation  with  Labeled  Nucleotides 

To  implement  the  use  of  modified  nueleotides  into  a  logie  problem,  eaeh 
modifieation  represents  a  variable  (e.g.,  “p”)  that  satisfies  two  eonditions.  The  presenee 
of  the  modifieation  is  taken  as  the  true  eondition  (p)  while  its  absenee  is  taken  as  the  false 
eondition  (p’).  For  example,  assign  the  DNA  modified  by  Alexa  Fluor-488  and  biotin 
with  the  variables  p  and  q,  respeetively.  If  a  DNA  moleeule  is  labeled  with  the  Alexa 
Fluor-488  modifieation,  eonsider  the  moleeule  to  be  “True”  at  variable  p.  If  it  is  labeled 
with  biotin,  eonsider  the  moleeule  to  be  “True”  at  variable  q.  The  AND  logieal  operator 
is  satisfied  when  the  DNA  moleeule  eontains  all  variables  that  satisfy  the  speeified  elause 
(e.g.,  DNA  labeled  with  both  biotin  and  Alexa  Fluor-488  satisfies  p  AND  q).  The  OR 
logieal  operator  is  satisfied  when  the  DNA  eontains  either  one  or  both  variables  that 
satisfy  the  speeified  elause  (e.g.,  DNA  labeled  with  either  biotin  or  Alexa  Fluor-488  or 
both  modifieations  satisfies  p  OR  q). 


Sequences  that  satisl\  p  OR  q: 
pq'r.  pq‘r‘.  p'qr. 
p'qr'.  pqr.  pqr' 


Figure  4:  Steps  in  Computing  p  OR  q 
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In  Figure  4,  labeled  DNAs  (‘True’)  are  represented  by  p,  q  and  r.  Unlabeled  DNAs 
(‘False’)  are  represented  by  p’,  q’,  r’.  Eaeh  Boolean  variable  represents  a  distinet 
modifieation  ineorporated  into  the  DNA.  This  figure  represents  a  seheme  for  the  isolation 
of  the  labeled  DNAs  (bound  to  the  binding  proteins)  although  this  approaeh  eould 
alternatively  be  used  for  retention  of  unbound  material. 


Modified  nueleotides  are  separated  using  speeifie  binding  proteins  linked  to 
magnetie  beads  to  yield  a  solution  to  a  SAT  elause.  (See  Figure  4.)  The  speeifie 
modifieations  of  interest  bind  to  the  beads  (bound  fraetion)  and  are  separated  from  other 
moleeules  using  a  small  magnet.  Unlabeled  DNAs  (unbound  fraetion)  are  removed  from 
the  tube,  leaving  only  sequenees  that  satisfy  the  speeified  elause.  Thus,  we  ean  isolate 
either  the  moleeules  with  the  modifieation  or  those  without  it  allowing  one  modifieation 
to  represent  both  p  and  p’.  Solution  is  eonfirmed  through  deteetion  of  the  presenee  or 
absenee  of  the  modifieation  on  the  DNA  via  dot  blot  [8].  (See  Figure  5.) 
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Figure  5:  Dot  Blots  Results 


In  Figure  5,  PCR  products  and  controls  labeled  with  specific  modified  nucleotides 
(AF  =  Alexa  Fluor-488,  BO  =  BODIPY-FL,  DIG  =  digoxigenin,  B  =  biotin,  +  =  positive 
control)  were  dotted  onto  nylon  membrane  and  treated  with  UV  light  to  crosslink  the 
DNA  permanently  to  the  membrane.  Membranes  were  incubated  with  specific  antibodies 
or  binding  proteins  (a)  anti-DIG,  b)  anti-BODIPY,  c)  anti-  Alex  Fluor,  and  d)  streptavidin 
(binds  biotin))  to  identify  the  modified  products.  Binding  proteins  were  then  localized 
either  using  alkaline  phosphatase  (a,  b,  &  c)  or  horse  radish  peroxidase  as  described  in 
the  methods.  Dark  spots  on  the  membranes  indicate  the  presence  of  antibodies  or  binding 
proteins  at  that  location.  Binding  proteins  recognized  only  specific  modifications  and  not 
other  unrelated  compounds. 

4.2  Using  SynDCode  to  Test  2-D  Array 

Using  a  software  upgrade  of  SynDCode  [9]  that  designed  strands  to  fit  a  desired 
DNA  melting  temperature  gradient,  we  performed  experiments  to  demonstrate  that  the 
denaturing  gradient  gel  electrophoresis  can  separate  duplex  DNAs  of  same  length 
according  to  their  melting  temperatures  and  this  method  is  very  precise  and  reproducible. 
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Figure  6:  Verification  of  Temperature  Dimension  Separation 

Initially  demonstrated  was  that  the  denaturing  gradient  gel  eleetrophoresis 
system  separated  eight  designed  40-bp  duplex  DNA  markers.  The  results  indieate  the 
duplex  DNAs  with  the  same  length  ean  be  separated  aeeording  to  their  melting 
temperatures.  Repeated  experiments  indieate  that  the  duplexes  with  lower  melting 
temperatures  always  migrate  fastest  in  the  denaturing.  The  orders  of  the  migrations  of 
the  duplexes  were  very  reprodueible.  (See  Figure  6 A.) 

To  further  eharaeterize  the  denaturing  gradient  gel  eleetrophoresis  system,  Figure 
6B  shows  the  separation  of  15  SynDCode  designed  duplex  DNAs  aeeording  to  their 
melting  temperatures.  The  first  dimension  denaturing  gradient  gel  system  eannot  separate 
those  DNA  moleeules  with  similar  melting  temperatures,  beeause  the  prineiple  of  the 
separation  is  based  on  the  faet  that  eaeh  duplex  DNA  of  same  length  should  have  a 
unique  melting  temperature  and  would  be  melted  at  its  unique  melting  temperature. 
Therefore,  in  the  last  lane  of  Figure  6B,  when  we  mixt  all  the  15  samples  in  one  lane,  we 
eannot  deteet  1 5  different  bands  on  the  gel,  beeause  several  of  them  have  almost  identieal 
melting  temperatures  as  shown  in  Figure  7.  Instead  we  ean  observe  eight  distinet  bands 
representing  the  different  groups  of  melting  temperatures  in  the  1 5  samples  (indieated  by 
eight  different  eolors  in  Figure  7).  However  eaeh  band  has  the  seeond,  seeondary 
strueture  to  distinguish  sequenees.  This  is  why  the  seeond  dimension  is  important. 
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Figure  7:  SynDCode  Test  Strands 

To  test  the  gradient  denaturing  gel  system  in  separating  large  number  of  different 
DNAs  with  different  sequenees,  we  used  several  SynDCode  duplex  DNA  eodes  of  sizes 
to  2^^.  We  ran  these  samples  side  by  side  in  the  gradient  denaturing  gel  system.  As 
expeeted,  the  results  indieate  as  the  number  of  different  sequenees  in  the  eode  inereased, 
the  DNAs  in  the  eode  are  separated  in  an  inereasing  area  and  quite  evenly  in  the  gel, 
indieated  by  a  long  smear  marked  by  the  red  reetangles  (See  Figure  8.) 
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Figure  8:  Large  Code  Dispersion 


4.3  Obtaining  a  Dense  2-D  Array 

Figure  3  diagrams  the  idea  of  the  2-D  matrix  and  Figure  7  diagrams  the  proeess  of 
making  it  more  dense  and  applying  it  to  the  output  of  the  modified  nueleotide  aqueous 
eomputing  system.  Our  experiments  have  shown  that  the  DNA  in  this  system  behaves 
eonsistently  and  reprodueibly.  Using  eight  distinet  50-mer  DNA  eodes,  a  total  separation 
of  50-mers  on  a  nylon  membrane  was  aehieved.  Figure  9  and  ref  [10]  show  that 

r\  1 

error- free  50-mer  eodes  of  size  2  exist  and  that  SynDCode  ean  find  them.  In  Figure  9, 
extrapolation  of  line  el  indieates  that  50-mers  are  suffieient  for  a  eode  of  size  2^\ 


9.78 


Figure  9:  Sufficient  Strand  Length  for  Error  -Free  Codes  of  Given  Sizes 
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r\  1 

To  make  the  dense  2-D  matrix,  eight  distinet  DNA  eodes  of  size  2  were 
separated  in  the  denaturing  gradient  gel  system  as  shown  in  Figure  8.  After  staining  the 
gel  (with  ethidium  bromide  or  other  reagents),  we  then  eut  the  area  eontaining  the  DNAs 
from  the  gel.  Next,  the  eut  out  gel  strips  are  aligned  and  transferred  to  a  eommereially 
available  membrane  sueh  as  super  eharged  nylon  from  Sehleieher&  Sehuell  by  standard 
blotting  methods  (indieated  by  the  red  box  in  Figure  lOA).  The  resulting  nylon  membrane 
now  has  the  DNA  eodes  attaehed  on  the  surfaee  ean  be  used  to  probe  any  radioaetive  or 
fluoreseent  labeled  input  DNA. 


Figure  10:  Making  and  Using  a  Dense  Array 
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To  verify  the  separation,  this  nylon  membrane  was  probed  with  3  different 
samples  having  inereasing  eomplexity.  In  the  first  experiment,  the  2-D  matrix  was  probed 
by  radioaetive  probes  using  (^X  174  DNA  (~  4  kb)  as  template.  As  shown  in  Figure  1 1  A, 
several  spots  in  the  2-D  matrix  indieate  hybridization.  After  stripping  off  the  first  sample, 
the  same  2-D  matrix  was  probed  by  probes  made  from  X  DNA  (~40  kb).  It  is  elear  that  in 
this  ease  more  areas  of  the  2-D  matrix  show  hybridization  (Figure  1  IB).  To  show  that  the 
DNA  moleeules  from  the  8  libraries  are  spread  in  the  nylon  membrane,  we  probed  the 
membrane  with  radioaetive  labeled  all  possible  20-mers  (4'^20  different  sequenees).  As 
shown  in  Figure  IIC,  the  radioaetive  20-mer  random  sequenees  hybridized  to  nearly  the 
entire  2-D  matrix  spaee  indieating  that  the  2-D  matrix  indeed  eontains  duplex  DNA 
moleeules  eovering  the  whole  2-D  spaee. 

(A)  (B)  |r  ir  ^  (C) 

Ir  .'.  - 

Probed  by  rEKfloacOve  Probed  by  radioactive  Probed  by  radk)acttve 

^174  DNA  XDNA  random  »>4ner  DNA 

Figure  11:  Verification  of  Dense  Array  Construction 
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5.0  CONCLUSION 


In  regard  to  labeled  nueleotides  to  perform  eomputations,  the  experiments 
demonstrate  that  DNA  ean  be  sueeessfully  modified  to  eontain  a  variety  of  fluoreseent 
and  non-fluoreseent  labels  and  that  these  modifieations  ean  be  used  to  do  eomputation. 
The  results  presented  here  allow  for  the  ereation  of  a  four  variable  library  with  labeling  of 
only  four,  as  opposed  to  eight,  sites.  We  expeet  to  be  able  to  seale-up  this  approaeh  sinee 
there  are  at  least  12  fluoreseent  and  3  non-fluoreseent  modifieations  with  eommereially 
available  binding  proteins.  With  all  of  these  ehemieal  eompounds  ineorporated,  either  as 
modified  primers  or  nueleotides,  we  would  be  able  to  solve  15-variable  problems.  Our 
work  shows  that  when  as  more  labels  beeome  available,  more  eomputing  power  ean  be 
obtained. 

In  regard  to  the  DNA  array  eonstruetion,  a  DNA  eode  of  total  size  10^  50-mers 
were  distributed  on  a  two  dimensional  array.  This  exeeeds  the  eurrent  10^  eapaeity  of 
photolithographie  or  ink-jet  arrays  and  will  allow  for  the  solution  of  a  21  variable  SAT 
problem.  This  array  eould  potentially  have  a  1 0  eapaeity,  allowing  for  the  solution  of  a 
50  variable  SAT. 

Different  sequenees  end  up  at  different  loeations  in  this  2-D  array.  The 
distribution  of  different  sequenees  is  self-direeted,  systematie  and  eontinuous.  The  2-D 
matrix  is  universal  in  its  applieation  and  easy  to  make.  A  eolleetion  of  fixed  sequenees 
ean  be  added  to  the  2-D  matrix  to  generate  a  grid  inside  the  2-D  matrix  for  enhaneed 
applieations,  however  the  2-D  matrix  is  self  ealibrating,  beeause  it  seeks  to  aehieve 
reeognizable  global  patterns  and  signal-proeessing  tools  ean  extraet  information. 
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5.1  Other  Applications 


One  bio-defense  applieation  is  that  the  2-D  genomie  matrix  system  proposed  here  is 
eapable  of  eapturing  a  snapshot  of  an  entire  biosystem's  state.  At  a  later  time,  the 
snapshot  ean  be  updated,  or  a  new  one  aequired,  and  eompared  to  a  previous  memory  to 
measure  ehanges  sueh  as  the  appearanee  of  pathogens.  For  example,  this  tool  eould  be 
used  for  monitoring  for  bioterrorism  agents  in  water  supplies.  The  goal  is  to  use  genomie 
information  at  the  population  or  eommunity  seale  to  monitor  and  deteet  the  existenee  of 
new  biota  (sueh  as  pathogens)  in  the  environment.  The  seope  of  organisms  with  their 
genomie  DNA  sequeneed  is  fairly  small.  Thus,  mueh  information  at  the  genomie  level  is 
not  available  with  eonventional  teehniques.  The  2-D  genomie  matrix  provides  a  way  to 
aeeess  information  from  all  organisms  in  a  eommunity  and  a  way  to  assess  impaet  by 
human  and  non-human  biomaterials.  The  2-D  genomie  matrix  does  not  require  explieit 
sequenee  knowledge  and  it  is  quiek,  flexible,  and  inexpensive  to  implement.  Thus,  it 
eould  provide  a  better  diagnostie  tool  for  environmental  monitoring  that  provides  a 
holistie  view  of  the  genomie  status  of  an  eeosystem. 
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